Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Personal title and career attributes extraction based on distant supervision and pattern matching
YU Dong, LIU Chunhua, TIAN Yue
Journal of Computer Applications    2016, 36 (2): 455-459.   DOI: 10.11772/j.issn.1001-9081.2016.02.0455
Abstract609)      PDF (1000KB)(915)       Save
Focusing on the issue of extracting title and career attributes from unstructured text for specific person, an distant supervision and pattern matching based method was proposed. Features of personal attributes were described from two aspects of string pattern and dependency pattern. Title and career attributes were extracted by two stages. At first, both distant supervision and human annotated knowledge were used to build high coverage pattern base to discover and extract a candidate attribute set. Then the literal connections among multiple attributes and dependency relations between the specific person and candidate attributes were used to design a filtering rule set. Test on CLP-2014 PAE share task shows that the F-score of the proposed method reaches 55.37%, which is significantly higher than the best result of the evaluation ( F-measure 34.38%), and it also outperforms the method based on supervised Conditional Random Field (CRF) sequence tagging method with F-measure of 43.79%. The experimental results show that by carrying out a filter process, the proposed method can mine and extract title and career attributes from unstructured document with a high coverage rate.
Reference | Related Articles | Metrics
Anomaly detection based on improved negative selection matching algorithm
XIAO Xiao-li,TIAN Yue-hong,CHEN Chuan
Journal of Computer Applications    2005, 25 (02): 383-385.   DOI: 10.3724/SP.J.1087.2005.0383
Abstract989)      PDF (121KB)(903)       Save

 A matching algorithm based on the negative selection for anomaly detection was presented in this paper. In the algorithm the effects of position between two temporal sequence to matching degree were considered. So it could distinguish accurately self and non-self, and reduced the size of detective set effectively. Using normal sequence calls, the initial detective set was created, and the detective set was extended by learning, according to the proportion of anomaly temporal sequence to judge whether this sequence was anomaly. Finally, the results of experiment was given.

Related Articles | Metrics